A mask-based enhancement method for historical documents

نویسندگان

  • Elisa H. Barney Smith
  • Jérôme Darbon
  • Laurence Likforman-Sulem
چکیده

This paper proposes a novel method for document enhancement. The method is based on the combination of two state-of-the-art filters through the construction of a mask. The mask is applied to a TV (Total Variation) regularized image where background noise has been reduced. The masked image is then filtered by NLmeans (Non Local Means) which reduces the noise in the text areas located by the mask. The document images to be enhanced are real historical documents from several periods which include several defects in their background. These defects result from scanning, paper aging and bleed-through. We observe the improvement of this enhancement method through OCR accuracy.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Enhancement of historical printed document images by combining Total Variation regularization and Non-local Means filtering

This paper proposes a novel method for document enhancement which combines two recent powerful noise-reduction steps. The first step is based on the total variation framework. It flattens background grey-levels and produces an intermediate image where background noise is considerably reduced. This image is used as a mask to produce an image with a cleaner background while keeping character deta...

متن کامل

A Novel Thresholding Method for Text Separation and Document Enhancement

Many thresholding-based image enhancement techniques have been developed and used for document analysis, where the simplicity and efficiency of thresholding makes it ideal to use for classifying layers within documents. However, the efficiency of these enhancement techniques can be impaired by the variation of grey levels in different documents, thus causing over-thresholding or under-threshold...

متن کامل

Degraded document image enhancement

Poor quality documents are obtained in various situations such as historical document collections, legal archives, security investigations, and documents found in clandestine locations. Such documents are often scanned for automated analysis, further processing, and archiving. Due to the nature of such documents, degraded document images are often hard to read, have low contrast, and are corrup...

متن کامل

The Development of "Naqsh-e Jahan" Square in Isfahan

Despite numerous studies regarding the development history of Naqsh-e Jahan Square, there are still many questions which have not been accurately answered to date. Some of them include the history of the square, the exact date of initiation and completion of construction of different elements of the square, and the order of their completion. This article tries to answer these questions accurate...

متن کامل

An Adaptive Method for Physical Documents Digitization based on Global Energy Function Parameter

The first step of physical document analysis system is to digitalize the physical document. Recently number of researcher present numerous techniques that can vary in sensitivity, quality and some more control parameter. This paper presents a three tier framework for physical document digitization and describes an automatic technique for document digitization that can significantly increase the...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2011